منابع مشابه
Clasificación de Páginas Web con Anotaciones Sociales
User-generated annotations on social bookmarking sites can provide interesting and promising metadata for web page classification. These annotations include diverse types of information, such as tags and comments. Nonetheless, each kind of annotation has a different nature and popularity level. In this work, we analyze and evaluate the usefulness of each of these social annotations to classify ...
متن کاملDAWEB: Un descargador y analizador morfológico de páginas web
DAWeb is a computer application developed as part of a project oriented to produce tools designed to get at the big flow of linguistic information of Internet documents. It is a tool for morphosyntactic analysis of great volumes of information —whole domains— reached by its URLs. The simple application interfaz facilitates the configururation of how to accessing and analysing the information ob...
متن کاملDescoberta de ruído em páginas da web oculta através de uma abordagem de aprendizagem supervisionada
Um dos problemas da extração de dados na web é a remoção de ruídos existentes nas páginas. Esta tarefa busca identi car todos os elementos não informativos em meio ao conteúdo, como por exemplo cabeçalhos, menus ou propagandas. A presença de ruídos pode prejudicar seriamente o desempenho de motores de busca e tarefas de mineração de dados na web. Este trabalho aborda o problema da descoberta de...
متن کاملEvaluación del clustering de páginas web mediante funciones de peso y combinación heurística de criterios
Web page clustering can help in the evaluation and search of the results of search engines, among other things. The different term weighting functions applied to the selected features to represent web pages is a main aspect in clustering task. In this paper, seven different term weighting functions are evaluated by means of the results of a partitioning clustering algorithm, with a reference we...
متن کاملComparativa de Aproximaciones a SVM Semisupervisado Multiclase para Clasificación de Páginas Web
In this paper we present a study for semi-supervised multiclass web page classification using SVM. We propose not only combining binary semi-supervised classifiers, but also multiclass supervised ones. Our experiments show great performance for the latter method, where ignoring unlabeled documents could be better for some cases, using only labeled documents for the learning task, directly based...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Programming Historian em português
سال: 2021
ISSN: 2753-9296
DOI: 10.46430/phpt0002